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Abstract 

Flagellated baeteria, sueh as Escherichia coli, perform direeted motion in gradients of eoneentration of 
attraetants and repellents in a proeess ealled ehemotaxis. The E. coli ehemotaxis signaling pathway is a 
model for signal transduetion, but it has unique features. We demonstrate that the need for fast signaling 
neeessitates high abundances of the proteins involved in this pathway. We show that further constraints 
on the abundances of chemotaxis proteins arise from the requirements of self-assembly, both of flagellar 
motors and of chemoreceptor arrays. All these constraints are specific to chemotaxis, and published data 
confirm that chemotaxis proteins tend to be more highly expressed than their homologs in other path¬ 
ways. Employing a chemotaxis pathway model, we show that the gain of the pathway at the level of the 
response regulator CheY increases with overall chemotaxis protein abundances. This may explain why, 
at least in one E. coli strain, the abundance of all chemotaxis proteins is higher in media with lower nutri¬ 
ent content. We also demonstrate that the E. coli chemotaxis pathway is particularly robust to abundance 
variations of the motor protein FliM. 
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Introduction 


Flagellated baeteria sueh as Escherichia coli are able to move up eoneentration gradients of ehemieal 
attraetants, and down gradients of repellents, in a proeess ealled ehemotaxis 111). The motion of these 
baeteria eomprises periods of straight swimming ealled “runs”, and random ehanges of direetion ealled 
“tumbles”. Run lengths are modulated to yield a three-dimensional random walk biased toward the pre¬ 
ferred direetion |2). Runs oeeur when flagella rotate eountereloekwise and bundle together, while tumbles 
oeeur when one or more rotate eloekwise and disrupt the bundle |^. 

In E. coli, transmembrane ehemoreeeptors form large and highly ordered arrays at the eell poles. 
Chemore(^tors are organized into trimers of dimers, and linked by CheW and CheA into a honeyeomb 
lattiee |3-0)> with a 6:1:1 reoeptor:CheA:CheW stoiehiometry in terms of monomers |3). Reeeptors eon- 
trol the aetivity of the histidine kinase CheA, whieh phosphorylates the eytoplasmie response-regulator 
protein CheY. Phosphorylated CheY (CheY-P) binds to FliM in the flagellar motor to induee eloek- 
wise rotation and tumbles. CheA also phosphorylates and aetivates CheB, a deaminase/methylesterase, 
that together with the methyltransferase CheR, reversibly modifies speeifie residues on the reeeptors to 
produee adaptation, i.e., to return to a baseline aetivity level when ehemoeffeetor eoneentrations stay eon- 
stant flJ3) . Upon an inerease in the eoneentration of ehemoattraetant, the aetivity of CheA deereases, 
whieh leads to fewer tumbles. Conversely, upon a deerease in the eoneentration of ehemoattraetant, the 
aetivity of CheA inereases, yielding more tumbles. This biases the eell’s motion toward elimbing the 
gradient of ehemoattraetant. 

E. coli ehemotaxis is a model for signal transduetion, and is a member of the family of two- 
eomponent signalingsystems that enable baeteria to sense and respond to various features of their 


environment (l9l. Illl.ll2h. However, the ehemotaxis pathway has unique features. First, ehemotaxis ealls 


for very fast response times. We demonstrate that this requirement neeessitates high abundanees of 
ehemotaxis proteins. Seeond, ehemotaxis involves large-seale multi-protein eomplexes, namely flagellar 
motors (13,113) and ehemoreeeptor arrays lifl)- We show that the eonsequent self-assembly require¬ 
ments impose additional eonstraints on the abundanees of ehemotaxis proteins. Beeause of these speeifie 
eonstraints, we hypothesize that ehemotaxis proteins will be more highly expressed than their homologs 
in other pathways. Published data are eonsistent with this predietion, but more data would be required 
to definitively eonfirm it. In addition, using a model of the ehemotaxis pathway, we show that the gain 
of the ehemotaxis pathway at the level of CheY-P inereases with overall ehemotaxis protein abundanees. 
This is eonsistent with the faet that artifieially overexpressing ehemotaxis proteins in a eoneerted manner 
inereases ehemotaetie effieieney, measured by a swarm assay Q. Moreover, it may help explain why 
the abundanee of all the ehemotaxis proteins ean be up to nine-fold higher in nutrient-poor versus rieh 
medium (16). We also demonstrate that the pathway is partieularly robust to abundanee variations of the 
motor protein FliM, in line with other robustness features of the ehemotaxis pathway (15. 17-19). 
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Models and methods 

Chemotaxis pathway model 

We model the E. coli ehemotaxis signaling pathway by the following system of ordinary differential 
equations for the average eellular eoneentrations of eaeh protein in the pathway: 


[CheAjtot = [CheA] + [CheA-P], 

[CheYjtot = [CheY] + [CheY-P] + [FliM ■ CheY-P] + [CheZ • CheY-P], 
[FliMjtot = [FliM] + [FliM • CheY-P], 

[CheZ] tot = [CheZ] + [CheZ • CheY-P], 

[CheB]tot = [CheB] + [CheB-P], 

^ "^c1t[CheA] - [CheA-P][CheY] + /cf [CheB]), 

= kl [CheA-P] [CheY] - [CheY-P] (fcf [CheZ] + k^[F\M\ + kl) 
+ A:|[CheZ • CheY-P] + [FliM • CheY-P], 

+ A;^)[FliM ■ CheY-P] - [CheY-P] [FliM], 

= (fcf + fcfat)[CheZ ■ CheY-P] - fcf [CheZ] [CheY-P], 

(i[CheB-P] ^ [CheA-P] [CheB] - fcf [CheB-P]. 


( 1 ) 

( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 


( 7 ) 

( 8 ) 
( 9 ) 

( 10 ) 


Here, eoneentrations are denoted by square braekets, and total eoneentrations by “tot”. Phosphorylated 
speeies are denoted by “-P”, and eomplexes by a dot between the two speeies names (e.g., FliM-CheY-P). 
The first five equations express eonservation of matter for eaeh protein, while the other ones eonvey the 
kineties of the ehemieal reaetions in the pathway. These reaetions are depieted in Eqs. [STHSSl of the 
Supporting Material. 

We focus on the adapted state of the pathway and on its initial response to attractant or repellent, 
without explicitly mode ling the slower dynamics of adaptation. In the adapted state, the active fraction a 
of CheA is modeled as (12011: 


fc«[CheR]tot 

[CheR]tot + A:f,t[CheB-P] ’ 


( 11 ) 


which follows if CheR methylates inactive receptors and CheB-P demethylates active receptors. This 
active fraction is taken into account in the system of differential equations in Eqs. [TlfT^ throu gh t he 
reduction of the time-averaged autocatalytic rate of CheA from to ak^^^ (see Eq.[^, as in Ref. (12 lb . 


Parameter values. We use experimentally-determined values for the reaction rates k in Eqs.fTIfTTl except 
for and (Table ISTT) . Indeed, while the reaction rates for CheY-P dephosphorylation by CheZ have 
been measured in vitro in the absence of CheA (l23l . it is known that CheZ binds to CheA-short, a 
translational variant of CheA that cannot autophosphorylate, and that this binding significantly activates 
CheZ (BQ. We thus adjusted /cf and in order to obtain a fraction of CheZ bound to CheY-P of 


30%, consistent with in vivo FRET measurements in the adapted steady state dH 


We use the average copy numbers of each chemotaxis protein per cell measured in Ref. (fl^ for 
strain RP437 in rich medium for all proteins but EliM, and those in Refs. (1271 12Sb for EliM, also in rich 
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medium (Fig.[T]). Importantly, the autoeatalytie rate of the histidine kinase CheA is inereased about 
100-fold when CheA is in eomplex with ehemoreeeptors and Che W (1291) . so only CheA in signaling 
eomplexes has signifieant kinase aetivity. Reeeptors are limiting dig 1301) for signaling eomplexes with 
a 6:1:1 reeeptor:CheA:CheW stoiehiometry (Q). Henee, we eonsider that the total number of CheA pro¬ 
teins per eell that ean be aetive (setting [CheAjtot) is one-sixth the total number of receptor monomers. 
It is also observed that less than 30% of FliM is found in complete flagellar motors (1271.1281. l3ll) . and 
that only 16% is in the soluble fraction dnt), while more than 25% of FliM [m 1), probably all the 
rest, is found in partially assembled structures (see Supporting Material). Isolated FliM molecules have 
a much lower affinity for CheY-P than FliM in motors, with a dissociation constant of 27 /xM (l32h versus 
3.5 fjM (l2lL l33U35h. which leads us to disregard isolated FliM. In the absence of any data to the contrary, 
we assume that CheY-P binds FliM in partly and fully-assembled motors with the same affinity. For each 
chemotaxis pathway protein, we derive the corresponding effective total cellular concentration using the 
standard E. coli cell volume of 1.4 fL ([H, 21) (Table IS2l). 


Numerical solution. We solve Eqs. fTIfTTIat steady state numerically using ‘NSolve’ (Wolfram Mathe- 
matica). The initial response to saturating attractant (or repellent) is obtained by abruptly decreasing the 
CheA active fraction a to 0 (or increasing it to 1) from its adapted value. Hence, we solve Eqs. [TlfT^ 
numerically with a = 0 (or 1), with the adapted concentrations as initial conditions, using ‘NDSolve’ 
(Wolfram Mathematica). 


Pathway gain. We are interested in the gain of the chemotaxis pathway. The input is the active fraction 
a of CheA, which directly depends on receptor states and hence on chemoeffector concentrations. We 
consider two different outputs: the concentration [CheY-P] of phosphorylated CheY, and the fraction ^ 
of EliM molecules bound to CheY-P, with corresponding gains defined by 


CcheY-P — 


A [CheY-P]/[CheY-P] 

Aa/a 


( 12 ) 


and 


_ A^/^ 
^ Aa/a 


(13) 


In practice, gains in the linear-response regime are computed for the pre-adaptation response to a 1% 
increase of the CheA active fraction a from its adapted value determined by Eq. [TTl 
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Figure 1: Schematic of the chemotaxis signaling pathway in E. coli. The number of copies per cell is indi¬ 
cated for each protein in the pathway. These numbers correspond to the measurements on strain RP437 
in rich medium in Ref. 113) for all proteins but FliM, and to the measurements in Refs. (E^ E^ for FliM, 
also in rich medium. 
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Results 


Fast response imposes constraints on the abundances of chemotaxis proteins 

Chemotactic trajectories are composed of straight “runs” and random changes of directions or “tumbles” 
(Fig.[2j4)- The mean run time of E. coli cells under adapted conditions is about one second (l36h . Hence, 
in practice, cells must make a decision whether to change direction in less than a second. The observed 
timescale of response to a saturating attractant is ~ 0.3 seconds 0). Fig. |2^ shows the timescales of 
the different molecular events involved in this response. The longest one is the dephosphorylation time 
~ 0.3 s of the cellular pool of CheY-P by the phosphatase CheZ (l2lb . Here, we show that this timescale 
implies lower bounds on the dissociation constant of FliM and CheY-P and on the abundances of several 
proteins in the chemotaxis pathway. 

The CheY-P molecules bound to FliM proteins need to unbind and to be dephosphorylated within this 
0.3 s for the pool of CheY-P to reflect the current chemoeffector concentration, thus ensuring an appro¬ 
priate response. The unbinding timescale is 1 /, where kf is the dissociation rate of FliM and CheY-P 
(see Eq.©, sol/k^ < 0.3 s implies kf > 3.3 /s. Since the binding of FliM and CheY-P is diffusion- 
limited, i.e. as fast as it can be, with a rate constant k^ = 5 /s /pM (12 iL 1421). the dissociation constant of 
FliM and CheY-P must satisfy = kf /kf > 0.7 pM. In reality, Kf = 3.5 pM (21, 33-35), and the 
associated unbinding timescale is 0.06 s. Hence, our lower bound on Kf is satisfied. 

In the adapted state, the fraction ip of FliM molecules that are bound to CheY-P should be in the inter¬ 
mediate range, in order to respond readily to both increases and decreases of the free CheY-P concentra¬ 
tion [CheY-P]. Assuming an adapted ip > 0.25, which is in the lower range of the region where the motor 
can switch rotation direction iill) . we obtain [CheY-P] > Kf /3 = 1.17 pM, and [FliM • CheY-P] > 
0.25 [FliM]tot = 0.35 /iM, where we used the total FliM concentration in TablelS2](see Methods and Mod¬ 
els). Hence, the total cellular concentration of CheY-P is CchsY-v = [CheY-P] -f [FliM-CheY-P] >1.5 /xM. 
Note that here we do not take into account the CheY-P that are bound to CheZ and thus essentially sure 
to be dephosphorylated (since kf^^ :» kf see Table |ST]). In practice, about 30% of CheY is phosphory- 
lated (136b . yielding (7cheY-p = 3 pM (using the total CheY concentration in Table [S2]). Hence, our lower 
bound on CchsY-v is satisfied, with the actual value being only twice as large. 

We now focus on the dephosphorylation of CheY-P, whose steady-state rate is (Eq. |9l) 


d[CheY-P] 


dt 


- -k^ 


cat 


[CheZ]t 


[CheY-P] 


dephos 


fcfat+fcf 


+ [CheY-P] 


(14) 


The whole cellular pool of non-CheZ-bound CheY-P, with concentration (7cheY-p> needs to be dephos¬ 
phorylated within 0.3 s. Using the minimal values of [CheY-P] and (ilcheY-p calculated above and rate 
constants in Table ISTl this requirement yields [CheZ]tot > 2.3 /iM. Note that using the experimental 
values for kf and fcf from Ref. (l22b . which disregard CheZ activation by CheA-short, gives a similar 


result: [CheZ]tot > 1-8 pM. Experiments yield [CheZ]tot = 3.8 pM (Table IS^. so here too, our lower 
bound is satisfied, with the actual value being less than twice as large. 

Eor turnover to occur within 0.3 s, ensuring that [CheY-P] reflects the current chemoeffector concen¬ 
tration, the whole cellular pool of non-CheZ-bound CheY-P also needs to be (re)phosphorylated within 
this time. Phosphotransfer from CheA-P to CheY being very fast, the limiting step is CheA autophos¬ 
phorylation (l43b . Hence, the steady-state CheY phosphorylation rate is simply Q;/c^jCheA]tot (Eq. [6]). 
Using the minimal values of (CcheY-p and of [CheZ]tot calculated above, and kf = 20 /s (Table ISTI). we 
obtain Q;[CheA]tot > 0.25 /xM. The total concentration of CheA in arrays (determined from the receptor 
concentration and the stoichiometry, see Models and methods and Table [S2l). [CheA]tot = 3.0 /xM, is 
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Figure 2: Response timescales in the chemotactic pathway. A. Schematic of a chemotactic trajectory: 
the bacterium swims straight during “runs” (lines with arrows), and randomly changes direction during 
“tumbles” (dots), resulting in a three-dimensional random walk. The mean run time under adapted con¬ 
ditions is about 1 s (136h . B. Schematic of the timescales involved in the initial (pre-adaptation) response 
to saturating attractant. The longest timescale corresponds to CheY-P dephosphorylation by CheZ (12It) : 
it is much longer than receptor switching motor switching (39), and unbinding of CheY-P from 

FliM (see main text), and slightly longer than CheY-P diffusion (^, ^). 


substantially larger than this lower bound. This hints at a low active fraction a, consistent with previous 
estimates, which range from a few percent (21,44) to about 30% (SB. 

Hence, the requirements of fast signaling impose lower bounds on the cellular concentrations of CheY- 
P, CheZ, and active CheA, as well as on the dissociation constant of CheY-P and FliM. These lower 
bounds are satisfied by experimental values, and are consistent with a low adapted CheA active fraction. 
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Pathway model accounts for observed concentrations and response times 

While the above simple arguments enabled us to derive eonstraints on the abundanees of ehemo- 
taxis proteins, a more detailed eomparison to observed eoneentrations and response times requires a 
mathematieal model. Here we present results from the pathway model given by Eqs. fTIfTTl Similar 
models have been produetively employed previously to investigate various aspeets of the ehemotaxis 
network (ll5l .ll8Ll2l[ |46l). Our foeus is on the impaet of protein abundanees on gain. 

The adapted steady-state of the ehemotaxis pathway is obtained by solving Eqs. fTIfTTl at steady-state 
with the parameter values in Tables fSTf and(see Models and methods). It yields [CheY-P] -f [EliM ■ 
CheY-P] = 3.0 /iM, i n ag reement with Ref. (l33h . and a proportion of phosphorylated CheY-P of 31%, in 
agreement with Ref. (l36h . Besides, we obtain a fraetion -0 of EliM moleeules that are bound to CheY-P of 
41% in the adapted state, whieh is in the funetional range where the flagellar motor is able to switeh (l2llb . 
We also obtain a fraetion a of aetive CheA of 25% in the adapted state, within the range of previous 
estimates (21, 2^ 44, 45). 

The initial (pre-adaptation) response of the pathway to instantaneous addition of saturating attraetant 
(or repellent) is obtained by solving Eqs. ITIfTOf with the adapted steady-state eoneentrations as initial 
eonditions, setting the CheA aetive fraetion a to 0 (or 1) (see Models and methods). Upon addition of 
attraetant, [CheY-P] is found to deerease to 0 with a half-time of 0.13 s, and the fraetion xjj of EliM pro¬ 
teins bound to CheY-P deereases with a half-time of 0.23 s (Pig. |3]). This is in reasonable agreement 
with Ref. 0), where the half-time for the deeay of CheY-P bound to EliM, observed experimentally by 
PRET, is 0.32 s. Note that the differenee between the timeseales obtained for [CheY-P] and for ^jJ from 
our pathway model indieates that the unbinding time of CheY-P from EliM is not negligible, eontrary to 
the usual assumption (l2lll) . Addition of repellent yields a faster response, with half-times of 0.07 s and 
0.08 s for the respeetive inereases of [CheY-P] and of (Pig. [3]). This is in reasonable agreement with the 
experimental value of 0.03 s for the half-time of the inerease of ^|J Q. Response to saturating repellent 
is faster beeause it relies on CheY phosphorylation by CheA, whieh is very fast when a = 1, while 
CheY-P dephosphorylation is limiting in response to attraetant (Pig. |3). 

The good agreement of the model with observations, obtained by adjusting only and fcf to mateh 
the fraetion of CheZ bound to CheY-P (see Models and methods), eneourages us to further study the 
model’s implieations. 
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Figure 3: Response to addition of saturating attractant or repellent, obtained from the pathway model 
(Eqs. [TlfT^. A. Concentration of free CheY-P ([CheY-P]) versus time after a step addition of saturat¬ 
ing attractant (blue curve) or repellent (red dashed curve). Addition of saturating attractant (repellent) is 
modeled by changing instantaneously the CheA active fraction, a, from its adapted value (cf. Eq.fTTI) to 
0(1). Dots indicate half-maximal response. B. Fraction ip of FliM proteins bound to CheY-P versus time 
after a step addition of saturating attractant (blue curve) or repellent (red dashed curve). Dots indicate 
half-maximal response. 


Effect of a concerted increase of protein abundances 

The overall abundances of chemotaxis signaling proteins (Che proteins and chemoreceptors) are variable 
across E. coli strains and growth conditions, but relative proportions are well-conserved dlbt) . Strik¬ 
ingly, these proteins are more highly expressed in minimal medium than in rich medium (E) . When the 
abundances of chemotaxis signaling proteins were varied in a concerted fashion (fisl) . the chemotactic 
efficiency of cells (measured by a swarm assay) was found to increase sharply up to wild-type abun¬ 
dance, and then to continue increasing much more gradually while progressively leveling off. Here, to 
mimic the experiment of Ref. E), we vary the abundances of CheA, CheY, CheZ, CheB and CheR, 
while keeping their proportions and the FliM abundance fixed, as in Table [S2l (We checked that varying 
the abundance of FliM in a concerted fashion with the rest does not affect our conclusions.) Solving our 
pathway model Eqs.fTlfTTlin the adapted steady state, we find that when protein abundances are increased, 
[CheY-P] and ip both increase sharply up to about reference abundances, and the increase then progres¬ 
sively levels off (Pig. UK). Our reference abundances (one-fold in Pig.lH) correspond to those measured 
in Ref. dlbh for strain RP437 in rich medium (see Models and methods and Table [S2l). 

The effect of a concerted variation of protein abundances on [^eY-P] was previously modeled in 
Ref. IlS). Our results (Pig. |4]4) are mostly consistent with Ref. (1181) . but using one adaptation model 
Ref. 


IlSh obtained a maximum in [CheY-P] versus fold abundance. In our framework too, modifying 
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details of the adaptation model (Eq. fTTI) ean result in sueh a maximum, but above one-fold expression 
(for realistie parameter values), so our main eonelusions are not affeeted. In Ref. (15), eloekwise bias was 
found to be monotonie versus eoneerted fold expression, whieh is eonsistent with our results (Fig.|4]4). 
Building on a similar framework to Ref. (1181) . we inelude CheZ saturation by CheY-P, whieh has now 
been measured (12 iL l26h . and we diseuss FliM occupaney ip and gain, and provide analytical insight for 
simple regimes. 

For reference abundances and higher, the steady-state phosphorylated fraction of CheA is very small, 
because of the rapidity of phosphotransfer from CheA-P to CheY Jd^ll . In this “fast phosphotransfer 
regime”, it is possible to solve analytically a simplified version of the pathway (see Supporting Mate¬ 
rial): if the auto-phosphorylation rate of CheA is less than the maximal dephosphorylation rate of CheY-P 
by CheZ, i.e. if 


a < 


fcfJCheZ] 


tot 


uA 


[CheA] 


(15) 


tot 


then 


[CheY-P] 


[CheZ],ot _ ^ ’ 

[CheAjto, 


(16) 


and Ip is given by Eq. lS20[ These expressions only depend on abundance ratios, on kinetic rate constants, 
and on a, which converges to a constant value at high abundances (see Supporting Material). Hence, 
in the high-abundance limit, these steady-state values of [CheY-P] and ip, which arise from the equili¬ 
bration of phosphorylation and dephosphorylation of CheY, converge to plateaus invariant to concerted 
variations of the overall abundances. Conversely, if the condition in Eq.[T5]is violated, CheZ is saturated, 
and [CheY-P] increases with overall abundances. The conditions for the fast phosphotransfer regime are 
satisfied with the standard abundances used here and with higher overall abundances (see Supporting 
Material). The plateaus of [CheY-P] (Eq.fT^ and ip (Eq. lS20l) are indicated by thin lines in Fig.|^. 

In the opposite limit of low abundances, two-molecule encounters become unlikely, including the 
binding of CheY to CheA-P, so the phosphorylated fraction of CheA becomes high, and only a small frac¬ 
tion of CheZ and of FliM are bound to CheY-P. Using the simplified pathway model presented in the Sup¬ 
porting Material, we show that if [CheY]tot <C [CheZ]tot <C k^^Jk^, 

then 


[CheY-P] 


[CheY]tot 


k.z ichpy].. 




(17) 


Hence, in the low-abundance limit, [CheY-P] grows in proportion with the overall abundances of the Che 
proteins. The same is true for ip (Eq. lS29l) . The low-abundance asymptotes Eqs. [T7] and IS29I are plotted 
as thin dotted lines in Fig. |4]4. 

Our pathway model also yields the gain (7cheY-p of the pathway at the level of the response regula¬ 
tor. This gain grows with overall abundance of chemotaxis proteins, and plateaus in the high-abundance 
limit (Fig. |4^). The corresponding asymptotic value can be determined analytically within the simpli¬ 
fied pathway model in the fast-phosphotransfer regime: Gchev-p (Efi- [I2l) can be obtained from Eq. (T^ It 
yields 


OcheY-P — 


[CheZ] tot 
[CheA] tot 

[CheZ]tot _ ^ ’ 

[CheAjtot 


(18) 


which becomes independent of overall abundances as a converges to its high-abundance limit. Besides, 
in this regime, it can be shown that (7cheY-p = [CheZ] tot/[CheZ] (see Supporting Material). Thus, the 
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Figure 4: Effects of fold-change of expression of all chemotaxis signaling proteins (as in Ref. (fish t. 
obtained from the pathway model in the adapted state (Eqs. [p|TTI). One-fold expression corresponds to 
the abundances in Table [S^ i.e. to those measured in Ref. dlbh for strain RP437 in rich medium. In 
the same rich medium, the fold expression for strain OWl is about 0.22, while in minimal medium, 
the fold expression is about 1.1 for strain RP437 and 2.0 for strain OWl dlbh (values averaged over all 
chemotaxis signaling proteins). A. Adapted free CheY-P concentration ([CheY-P]) versus fold expres¬ 
sion of the chemotaxis proteins. Inset: adapted fraction ip of EliM proteins bound to CheY-P versus fold 
expression. Thin horizontal lines: analytical high-abundance limit in the fast-phosphotransfer regime 
('Eqs.[T^and lS20l) . Thin dotted lines: analytical low-abundance limit (Eqs.fTTland IS291) . B. Correspond¬ 
ing gain in the linear-response regime. Blue curve: gain for CheY-P, Gchev-p (Eq. fT^ . Red (dashed) 
curve: gain for ip, (Eq. [T3]) . Thin horizontal lines: analytical high-abundance asymptotic gains in 
the fast-phosphotransfer regime (Eqs. [T8] and IS24I) . Dotted curve: ratio of total CheZ concentration 
to free CheZ concentration, [CheZ] tot/[CheZ]; in the simplified-pathway fast-phosphotransfer regime, 
GcheY-p = [CheZ] tot/[CheZ] (see Supporting Material). 


gain in [CheY-P] arises from the saturation of the phosphatase CheZ by CheY-P: increasing the active 
fraction a of CheA increases phosphotransfer to CheY, and hence [CheY-P], but this increase is larger 
than that of a because, at the same time, CheZ becomes more saturated, reducing the rate of dephospho¬ 
rylation of CheY-P (see also Ref. d47h). In Pig.|4^, the thin horizontal blue line represents the plateau for 
CcheY-p (Eq.[T8]), and the dotted curve shows [CheZ]tot/[CheZ]: at sufficiently high abundances, it closely 
approximates the gain derived from numerical solution of the full pathway. Similarly, (Eq.[T3]) can be 
determined analytically within the simplified pathway model in the fast-phosphotransfer regime (Eq. lS24[ 
thin horizontal red line in Pig. H^). 

We conclude that the gain in CheY-P increases with overall abundances, up to about reference levels. 
Moreover, chemotactic ^naling is robust with respect to concerted overexpression of the chemotaxis 
proteins (see also Ref. (1151)). as [CheY-P] remains lower than = 3.5 /xM, so that ip < 0.5 remains 
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Abundances of chemotaxis proteins 


in the functional range, below the threshold value (about 0.57) above which the motor only rotates 
clockwise 0). 


Effect of separately varying the concentration of each protein in the pathway 

To study the effect of varying the abundance of each protein separately on the adapted steady state of the 
pathway, we separately varied CheY, FliM, CheA, CheZ, CheR, or CheB abundances, while keeping the 
abundances of all others fixed (values in Table [S2l). Specifically, we calculated the gains Gchev-p and 
(Fig. [5]), as well as the fraction ^jJ of FliM molecules bound to CheY-P (Fig. [S2l) and the concentration 
[CheY-P] of free CheY-P (Fig. 


The effect on [CheY-P] of protein abundance variations was investigated in Ref. dlSl) . In addition to 
the differences mentioned above, this previous study did not include FliM, but included CheW and recep¬ 
tors. Our results (Fig. [S3]) are consistent with those of Ref. (Il8h for abundance variations of CheY, CheZ, 
CheR, and CheB. However, Ref. (ilSh obtained a weak maximum of [CheY-P] upon CheA abundance 
variation, arising from their model of CheA interactions with CheW and receptors. We focus on gain, 
and on the stability of the pathway to FliM abundance variation, which were not included in Ref. (18). 





Figure 5: Effect of fold-change of expression of each chemotaxis signaling protein separately, obtained 
from the pathway model in the adapted state tEgsdllfTT]). One-fold expression corresponds to the abun¬ 
dances in Table [S2l i.e. to those measured in Ref. (ilbh for strain RP437 in rich medium, as in Eig.jH A-F 
(linear-log plots). Blue curves: gain for CheY-P, Gchev-p (Eq.[l2l), versus fold expression of each protein, 
keeping all others at their one-fold level. Red dashed curves: gain for -0, (Eq. [T3]) . In the shaded 
zones, -0 is either smaller than 0.11 or larger than 0.57 (Eig. lS^ . in which case the flag ellar motor should 
rotate only counterclockwise or only clockwise, respectively, in the adapted state (|2l|). 


Eig.[5]shows that the gain of the chemotaxis pathway is robust to moderate individual variations of the 
abundances of each protein. Variations of G^ are even weaker than those of Gchev-p, due to the mitigating 
effect of EliM saturation by CheY-P. 
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Fig. [SK shows that Gchev-p increases with CheY abundanee. Indeed, inereased CheY abundanee (at 
eonstant total CheB and CheA levels) results in less CheB phosphorylation, due to eompetition for CheA- 
P. Redueed [CheB-P] in turn results in an inerease of the adapted a (Eq. [TTI) . and henee of [CheY-P] and 
-0 (Fig.ISlK). Higher [CheY-P] (at eonstant total CheZ level) means that CheZ beeomes more saturated, 
inereasing Gchev-p (see above). Consistently, in the fast phosphotransfer regime, Eq.[T8]shows that Gchev-p 
is an increasing funetion of a, which itself increases with CheY abundanee, for the above-mentioned 
reasons. 

Eig. [5^ shows that the gains are almost independent of the abundanee of EliM, and Eig. [S2b shows 
that the same is true for -0. In addition, solving our simplified pathway model in the fast phosphotransfer 
regime gives expressions for [CheY-P], for -0, and for the gains that are entirely independent of EliM 
abundanees (Eqs.[T^and lS20[[T^ and [S24l) . This robustness of the pathway to EliM abundanee variation 
arises from the faet that, in eontrast to the free CheY-P moleeules, the ones that are bound to EliM ean- 
not be dephosphorylated by CheZ (they ean auto-dephosphorylate, but this proeess is mueh slower than 
dephosphorylation by CheZ). This is analogous to the ease of transeription faetors studied in Ref. 
if transcription factors (or in our case CheY-P) can be degraded (or in our ease dephosphorylated) only 
when they are not bound to their DNA targets (bound to EliM), then the eoneentration of non-bound 
transeription factors is independent of the number of DNA targets (EliM moleeules). 

In Eig.[5]C, GcheY-p features a weak maximum at about two-fold abundanee of CheA. Inereasing CheA 
abundance raises the level of phosphorylation of CheY, which yields an increase of -0 (Eig. [S2C). and 
moreover inereases saturation of CheZ, whieh inereases Gchev-p- However, once the CheA abundanee is 
so high that almost all CheY is phosphorylated and almost all CheZ is saturated, inereasing a primarily 
increases [CheA-P] and not [CheY-P]: henee, in this regime, the gain deereases with CheA abundanee. 
The maximum in Gchev-p is smoothed out in due to EliM saturation (Eig.[5]C). 

Inereasing CheZ abundanee has the opposite effeet of inereasing CheA abundance, sinee these two 
enzymes have an antagonistic role in the pathway. Aeeordingly, Gchev-p features a weak maximum 
at about 0.5-fold abundanee of CheZ (Eig. OD), with xjj decreasing when CheZ abundanee inereases 
(Eig. [S2t)). Inereasing CheR abundance yields an increase of the aetive fraction a of CheA (Eq. fTTI) . 
Henee, it is effeetively similar to inereasing CheA abundanee (Eigs. [5f! and [S2£). Finally, inereasing 
CheB abundanee has the opposite effect, i.e. a similar effect to increasing CheZ abundance (Figs. [5p 
and[S2p). 


Discussion 


Fast signaling requirements impose strong constraints on the chemotaxis pathway 

The ehemotaxis pathway is a member of the family of two-eomponent signaling systems that enable 
baeteria to sense and respond to various features of their environment. This pathway is widely studied 
as a model signaling system. However, it faees speeific constraints. Chemotaxis regulates cell swimming 
with response times of a fraction of a second. Eonger response timeseales would direetly increase the lag 
between detection of a chemoeffector concentration and ehange in motion, with potentially deleterious 
consequenees in extreme environments (e.g. in steep repellent gradients), but also in fast-changing ones. 
The latter ease eould be particularly important evolutionarily as motility peaks at the entry into stationary 
phase, when bacteria are eompeting for searee resourees (l49h . In eontrast, the output of most other two- 


eomponent systems lies in transeriptional regulation (l9Lllll.ll2h. These systems feature overall in vivo 
response times of minutes to hours ds^ . and their signaling involves phosphorylation reaetions with in 
vitro timeseales of minutes (B. 
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We have shown that the requirements of fast signaling impose lower bounds on the dissoeiation eon- 
stant between CheY-P and FliM, and henee on CheY-P abundanee, as well as on CheZ, and CheA 
abundanees. These lower bounds are satisfied by experimental values, giving the right order of magni¬ 
tude for CheY-P, and CheZ, and indieating a low aetive fraetion a of CheA in the adapted state. In 
practice, our pathway model gives a = 0.25 in the adapted state, within the range of previous estimates, 
which vary from a few percent (21.44) to about 30% (3.45). Note that similar constraints might exist on 
the abundance of CheR and CheB, since they control the dynamics of adaptation (0. However, rapidity 
constraints are less obvious on the adaptation timescales than on the fast response timescales studied 
here. 

Several other features of the chemotaxis pathway reflect a pressure towards rapidity. First, the 
existence of a dedicated phosphatase for the response regulator CheY, which is uncommon for two- 
component systems, suggests the importance of fast turnover of the CheY-P pool. Second, CheA is 
an extremely fast histidine kinase: when incorporated in signaling complexes containing chemorecep- 
tors and CheW, the autocatalytic rate of CheA is = 20 /s for E. coli (12-3) and Salmonella 
typhimurium (l29t) . which makes it four to five orders of magnitude faster than other kinases in two- 
component systems (Table [S3]). Another possible signature of the pressure towards rapidity is that the 
response timescale of the chemotaxis pathway is only slightly larger than the diffusion time of CheY-P 
across the cytoplasm (Fig. [2^), estimated using measured diffusion coefficients (13. 13) and a character¬ 
istic cell size of ~ 1 pm. Hence, the response of the chemotaxis pathway is almost as fast as it can be. 
Note that our model, which focuses on average concentrations, should slightly underestimate response 
timescales due to the neglect of diffusion. Thus, a full spatial model (jSSj) should yield slightly more 
stringent lower bounds on protein abundances. 

One can wonder why the adapted CheA active fraction a is low (< 30%) while CheA is pushed 
towards extremely high rapidity of autophosphorylation. Ref. (l44l) shows that a low a makes the dynam¬ 
ics of the pathway response robust to slowly varying multiplicative noise. The pathway output is assumed 
to be proportional to a, with the proportionality factor fluctuating, but more slowly than the response 
timescales of the pathway. In Ref. (l44h . the output is chosen to be the fraction of CheZ bound to CheY-P, 
which is measurable by FRET. The noisy proportionality factor then involves the ratio of total CheA 
abundance to total CheZ abundance (see Eq. ISlSI) . The robustness of the dynamics to such multiplicative 
noise arises from the fact that at low a, the signal amplification at the receptor level is exponen¬ 
tial, via the Boltzmann factor for CheA to be in its active state (l44h . Since rapidity constraints imply 
a[CheA]tot > 0.25 pM, requiring in addition an adapted a < 30% implies [CheAjtot > 0.83 /iM, which 
is only ~ 3 — 4 times lower than the experimental value. Note that this would also entail a lower bound 
on receptor concentration of 5.0 /rM, given the stoichiometry of the array. 

Since the requirement of fast signaling calls for high abundances of chemotaxis proteins, it follows 
that these protein levels should be higher than in homologous systems with different outputs. Many 
bacteria with chemotaxis pathways similar to that of E. coli (13) . and for which similar timescales are 
expected dSVj), possess multiple gene clusters encoding Che proteins. Some of these paralogs regulate 
twitching motility based on type IV pili, while others are involved in very different cellular functions, 
such as development, biofilm formation, cell morphology, cell-cell interactions, and flagellar biosyn¬ 


thesis (15611581) . In the Supporting Material, we compare expression of the Che proteins involved in 


chemotaxis to the expression of those from paralog clusters, in five different bacteria {Pseudomonas 
aeruginosa. Vibrio cholerae, Caulobacter crescentus, Sinorhizobium meliloti, Rhodobacter sphaeroides), 
using data from published microarray studies. We find that homologous non-chemotactic genes are 
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significantly less expressed (at the mRNA level) than the ones aetually involved in ehemotaxis (Tables [S4l- 
[S8]) . with the exception of the CheY involved in twitching motility in P. aeruginosa, which might also be 
subjeet to rapidity eonstraints (see Supporting Material). 

It is also interesting to eompare the eellular abundanees of CheA and CheY to those of the histidine 
kinases (HKs) and response regulators (RRs) in other two-component signaling systems. Table [^pro¬ 
vides sueh a eomparison for E. coli. The protein abundance data come from several published studies and 
show significant variability, which may be explained by differenees in media, growth phases, strains, and 
techniques, and the comparison should thus be taken with caution. However, it appears that CheA pro¬ 
teins are orders of magnitude more highly expressed than all the other HKs for whieh data are available 
(Table [S9l). The eomparison is less striking for CheY, since it does not appear to be partieularly highly 
expressed among RRs in the data from Ref. (l59h . but the protein abundance measured in Ref. (ilbh is 
mueh higher, and would plaee CheY among the most highly expressed RRs (Table [S9l). While in E. coli, 
CheA and CheY are expressed at eomparable levels rflbh . a number of other RRs are one or two orders 
of magnitude more highly expressed than their eognate HKs (Table [S9l). In two-component systems with 
bifunctional HKs that also dephosphorylate their cognate RRs, high RR abundanees enable the level of 
phosphorylated RR to be insensitive to variations in the HK and RR abundances (IbOl. lb It) . The E. coli 
ehemotaxis pathway is different sinee it possesses a dedicated phosphatase, CheZ. However, the eondi- 
tion for obtaining a plateau of [CheY-P] at high abundanees (Eq.[T5]) and the corresponding adapted value 
of [CheY-P] (Eq.fT^ both depend on the ratio of CheZ to CheA abundanees, whieh may liuetuate. Addi¬ 
tional meehanisms provide robustness with respect to this ratio. Eirst, CheZ is aetivated upon interaction 
with CheA-short (l23l.l24ll. and most phosphatase activity takes plaee at the receptor arrays (l25h . whieh 
keeps phosphatase aetivity eoupled to kinase abundanee. Seeond, CheZ oligomerizes in the presence of 
CheY-P (|62|) . and this increases its aetivity (l63h . Einally, the dependenee of a on [CheB-P] (Eq. [TTI) . 
together with the eompetition between CheB and CheY for CheA-P, are thought to eouple kinase and 
phosphatase aetivities sinee CheZ and CheY abundanees are strongly eoupled (|44) . 

Henee, high ehemotaxis protein abundanees appear to arise from the speeific rapidity eonstraints 
on the ehemotaxis pathway. Supporting this view, we note that ehemotaxis protein abundances simi¬ 
lar to those in E.^li are found in the Gram positive baeterium Bacillus subtilis, whieh has even more 
chemoreceptors 


Self-assembly requirements yield additional constraints 

Apart from the eonstraint of fast signaling, the ehemotaxis pathway is also unusual among two- 
eomponent systems in that it involves two types of large self-assembled multiprotein eomplexes: the 


ehemoreeeptor arrays, whieh allow for signal amplification via cooperativity ([SL [30, [45L [65[) . and 
the rotary flagellar motors, whieh enable the eell to swim. The self-assembly requirements of these 
complexes also eontribute constraints on the abundances of ehemotaxis proteins. 

Eirst, inclusion in receptor arrays increases the autoeatalytic rate of CheA by two orders of mag¬ 
nitude ([29[) , so only CheA in arrays is functionally relevant (see Models and methods, and Table [S2l). 
However, overall cellular proportions reveal a significant excess of CheA with respect to the preeise 
6:1:1 reeeptor:CheA:CheW stoichiometry of the receptor arrays (0); for instanee, overall proportions 
are 2.2:1:1 for strain RP437 in rieh medium ([13). Ref. (B shows that overexpressing reeeptors up to 
~ 7-fold wild-type level at native CheA level leads to a stronger response to repellent, i.e. to a stronger 
kinase activity, which shows that CheA is strongly in excess in these conditions too. Besides, in vitro 
assembly of receptors alone leads to the formation of non-funetional struetures, ealled zippers, while 
adding CheA and CheW in exeess to stoiehiometrie array proportions yields arrays (|3). Henee, in E. coli. 
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the correct self-assembly of the receptor arrays seems to require an excess of CheA. Note however that 
overall cellular proportions appear to be different in B. subtilis, but this bacterium also expresses soluble 
(non-transmembrane) chemoreceptors 

Second, self-assembly of the flagellar motor appears to constrain the abundance of the protein FliM. 
In the motor, FliM forms a ring of ~ 32 subunits (2^ which bind CheY-P to mediate switching of the 


direction of motor rotation. Studies (12 iL l27Ll28L l3lh reveal that only a small fraction of FliM (< 30%), 
is part of complete motors (see Supporting Material, esp. Table IS lOl) . Nevertheless, underexpression and 
overexpression experiments indicate that FliM constitutes a limiting resource for proper motor assem¬ 
bly 1271) ■ Consistent with this observation, more than 25% of FliM is found inpartially assembled 
structures (1281. l3lb. with only about 16% of FliM copies free in the cytoplasm (13 lb . Since it is likely 
that FliM in partially assembled structures binds CheY-P with an affinity comparable to FliM in com¬ 
plete motors, these additional FliM contribute to the lower bound on the total cellular concentration of 
CheY-P, CcheY-p = [CheY-P] -f [FliM • CheY-P], yielding the second term, which accounts for 23% of 
the total CheY-P lower bound. Hence, motor self-assembly requirements on FliM abundance provide a 
separate lower bound on CheY-P abundance only a factor of ~ 4 lower than our complete lower bound, 
which involves the actual value of . 

Since FliM is in excess of the requirement for complete motors, one can ask if the FliM level is 
constrained by signaling requirements. However, our study demonstrates that the gain, as well as the out¬ 
put of the pathway, are very robust to variations of the abundance of FliM (Fig. [5^). The gene encoding 
FliM does not belong to either of the meche and mocha operons that encode the Che proteins (I^ 15, 6^ . 
Hence, FliM expression levels are likely to feature non-negligible abundance fluctuations with respect to 
other proteins in the pathway, making robustness to FliM abundance variations a useful feature. 


Gain and cooperativity are increased by a concerted increase of protein abundances 

The abundances of chemotaxis proteins were measured in two different E. coli strains considered wild- 
type for chemotaxis, in both rich and minimal growth media, in Ref. dlfih . Strikingly, chemotaxis proteins 
tend to be more expressed in minimal medium than in rich medium. While this increase is modest for the 
reference strain RP437, where Che protein abundances increase from 1 fold to 1.1 fold, it is very strong 
for strain OWl, where Che protein abundances increase 9.4 times, from 0.22 fold to 2.0 fold. 

Proportions are well-conserved despite this high variability of abundances (llbh . The Che proteins are 
expressed from two adjacent operons in the E. coli genome, the meche operon, which encodes CheR, 
CheB, CheY, CheZ, as well as two types of chemoreceptors, and the mocha operon, which encodes 
CheA and CheW 113, [H 0. Both meche and mocha operons are in the same regulon: they are under 
transcriptional control of the sigma factor and of the anti-sigma factor FlgM ({l5I). In addition to this 
transcriptional coupling, these genes also feature translational coupling (jb^)- This enables the expression 
levels of the Che proteins to be correlated and their proportions to be stable (01 Til) . In Ref. (flih . where 
the abundances of chemotaxis proteins were varied in a concerted fashion by modulating the expression 
of FlgM, the chemotactic efficiency of cells (measured by a swarm assay) was found to increase sharply 
up to about wild-type abundance, and then to keep increasing much more gradually while progressively 
leveling off. 

We find that the gain at the level of the response regulator CheY-P increases substantially for con¬ 
certed increases of the abundances up to about reference levels, and more moderately above reference 
levels, reaching a plateau in the high-abundance limit. This dependence of the gain on protein abun¬ 
dances (Fig. 11^) is consistent with the swarm assay results of Ref. (flsl) . Gain is a crucial quantity since 
drift velocity in a shallow chemoeffector gradient is proportional to gain dTTl) . Moreover, an increase of 
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the gain could further sensitize cells to small changes of attractant concentration (l45h . which may be 
beneficial in poor media. This effect is strong for strain OWl, where we find that the gain in [CheY-P] is 
increased by a factor ~ 5 in minimal medium vs. rich medium (even more if the small variations in abun¬ 
dance ratios are accounted for). The small gain obtained for 0.22-fold abundance, corresponding to 
the expression level for strain OWl in rich medium (Fig.|4^) arises from the small non-phosphorylated 
CheY reserve in this case (only ~ 5% of total CheY in adapted conditions), which entails a small response 
to an increase of a. Note in addition that CheY and CheZ abundances in strain OW1 in minimal medium 
are smaller than our lower bounds derived from rapidity constraints, indicating slower response times. 

In addition to the increase of gain, receptor overexpression has been shown to increase coopera- 
tivity among receptors by increasing the size of receptor signaling teams (0 S [^- This additional 
cooperativity can also increase sensitivity to low attractant concentrations. Together, these increases 
of sensitivity help explain why the proteins of the chemotaxis pathway are overexpressed in minimal 
medium compared to rich medium dlbh . despite the cost of additional protein expression. 
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1 Chemotaxis pathway model: chemical reactions and parameter values 

The chemical reactions corresponding to our E. coli chemotaxis pathway model in Eqs.fTlfT^of the main 
text are: 


CheA CheA-P, 

(SI) 

CheA-P + CheY % CheA + CheY-P, 

(S2) 

CheA-P -f CheB % CheA + CheB-P, 

(S3) 

CheZ + CheY-P ^ CheZ • CheY-P CheY + CheZ, 

(S4) 

CheY-P % CheY, 

(S5) 

FliM • CheY-P % FliM + CheY , 

(S6) 

kM 

FliM + CheY-P ^ FliM • CheY-P , 

uM 

(S7) 

CheB-P % CheB. 

(S8) 


As in the main text, phosphorylated species are denoted by “-P”, and complexes by a dot between the 
two species names (e.g., FliM ■ CheY-P). Eq. [ST] corresponds to autophosphorylation of the histidine 
kinase CheA, and the fraction a of active CheA is accounted for by an effective reduction of the auto- 
catalytic rate from to Eq. [S^and Eq. |S3] represent phosphotransfer from CheA-P to CheY 

and CheB, respectively. Eq. |S4l expresses dephosphorylation of the phosphorylated response regulator 
CheY-P by the phosphatase CheZ, and Eq. the (much slower) auto-dephosphorylation of CheY-P. 
Similarly, Eq. [^corresponds to auto-dephosphorylation of the FliM • CheY-P complex, and Eq. ISSlto 
auto-dephosphorylation of CheB-P. Finally, Eq. [^represents the binding of the phosphorylated response 
regulator CheY-P to the FliM protein, which is a part of the flagellar motor, as well as their unbinding. 

The values of the rate constants used are presented in Table [STJ and the values of the effective total 
cellular concentrations are presented in Table [S^ These concentration values are used as references when 
abundances are varied in model calculations. 
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1 CHEMOTAXIS PATHWAY MODEL: CHEMICAL REACTIONS AND PARAMETER VALUES 


Table SI: Values of the rate constants of the E. coli signaling pathway used in our 
model. Note that a comprehensive list of experimental values is available online at 
‘http://www.pdn.cam.ac.uk/groups/comp-cell/Data.htnil’. fcf and were adjusted to yield consistency 
with FRET data (see Models and methods). 


Constant 

Value 

Notes and references 

~1^ 

^cat 

20 /s 

C^-M) 

kl 

40 /s//iM 



15 /s//iM 



2.3 /s//xM 

Adjusted; 5.6 /s//iM in the absence of CheA (221. 

kl 

0.04 /s 

in the absence of CheA (221 

UZ 

^cat 

12.3 /s 

Adjusted; 4.9 /s in the absence of CheA (221. 


0.04 /s 

OO-IZ) 

e 

5 /s//iM 

Diffusion-limited (21. 421. 

uM 

18/s 

Erom = 3.5 /xM (21, 33-3^ and . 

UB 

0.37 /s 

(73,7^ 

uR 

^cat 

0.12/s 

Eor S. typhimurium CheR (751. 

uB 

^cat 

0.29 /s 

(m 


Table S2: Values of the effective total cellular concentrations of the chemotaxis proteins used in our 
pathway model (Eqs. fpfTTT) . These values derive from the total numbers of each chemotaxis protein per 
cell measured in Ref. (ilbl) for strain RP437 in rich medium for all proteins but EliM, and from those in 
Refs. (I 27 L l28h for EliM, also in rich medium. Eor CheA and EliM, we take into account additional con¬ 


straints imposed by the assembly of chemoreceptor arrays and flagellar motors, respectively^ explained 
in Models and methods in the main text. We use the standard E. coli cell volume of 1.4 fL (Il5l.l2lb. 


Protein 

Total concentration (/xM) 

Notes and references 

CheA 

2.97 

1/6 of the chemoreceptor concentration, 17.8 /xM (161. 

CheY 

9.73 

a© 

CheZ 

3.80 

(M 

CheB 

0.28 

m 

CheR 

0.17 

m 

EliM 

1.43 

(21. 281. The 16% of EliM that are free (311 are discounted. 
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2 Simplified pathway model 

2.1 Assumptions and model 

The full pathway model, eorresponding to the ehemieal reaetions in Eqs. ISTHS^ is written in Eqs. [U- 
[TO] in the main text. Here, we present an analytieally traetable simplified version for steady state. Our 
simplifying assumptions are the following: 

• With regard to CheA-P levels, we negleet phosphotransfer to CheB with respect to phosphotransfer 
to CheY, i.e., we assume fc^[CheY] fcf [CheB] (see Eq. |^. Indeed, the total concentration of 
CheY is much larger than that of CheB (Table IS^. and in addition kl > (Table ISTT). More¬ 
over, CheY-P is dephosphorylated much faster than CheB-P, due to the existence of the dedicated 
phosphatase CheZ, so its turnover is much faster. 

• We treat the active fraction a of CheA as a parameter, without explicitly relating it to the CheB-P 
concentration (e.g., as in Eq. [TTI) . Thanks to this simplification, and to the previous one, CheB 
decouples from the rest of the system, and can thus be ignored. 

• We neglect auto-dephosphorylation of CheY-P, as it is much slower than dephosphorylation by 
CheZ. 

• We neglect auto-dephosphorylation of CheY-P in the complex EliM • CheY-P, as it is much slower 
than dissociation of this complex <C kf, see Table ISTl). 

• We neglect dissociation in the complex CheZ-CheY-P, as it is much slower than dephosphorylation 

(fcf Table [SB- 

Under these assumptions, at steady state (i.e., when all time derivatives vanish), the pathway model in 


Eqs. [BdO] becomes: 

[CheA]tot = [CheA] + [CheA-P], (S9) 

[CheY],ot = [CheY] + [CheY-P] + [EliM • CheY-P] + [CheZ ■ CheY-P], (SIO) 

[CheZ] tot = [CheZ] + [CheZ • CheY-P], (S11) 

[EliM]tot = [EliM] + [EliM • CheY-P], (SI2) 

ak^jCheA] = k^ [CheY] [CheA-P], (S13) 

kg^ [CheZ • CheY-P] = k^ [CheY] [CheA-P], (S14) 

fcf,t[CheZ • CheY-P] = fcf [CheZ] [CheY-P], (S15) 

fcf [CheY-P] [EliM] = kf [EliM • CheY-P]. (S16) 


In this system, Eqs. IS 13IIS151 simply express the equality of the phosphorylation and dephosphorylation 
speeds of CheY at steady state. 

2.2 Fast phosphotransfer limit 

Given the rapidity of phosphotransfer from CheA-P to CheY at standard cellular concentrations (Ii3),a 
relevant limit is the “fast phosphotransfer limit”, where CheA-P very rapidly undergoes phosphotransfer. 
In this limit. 


[CheA] ^ [CheA]tot > [CheA-P]. 


(S17) 
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2 SIMPLIFIED PATHWAY MODEL 


The simplified system Eqs. [S91IS16I ean be solved analytieally in the fast phosphotransfer limit, 
yielding sueeessively: 


[CheZ • CheY-P] 

[CheY-P] 

^ _ [FliM • CheY-P] 

[FliMjtot 


UA 

a ^ [Che A] tot, 

^cat 


a 


juZ 


[CheZ],c 

[CheA]t, 


— a 


uA 

^cat 


a 




(SI 8) 

(519) 

(520) 


One neeessary eondition for the fast phosphotransfer limit to apply is that [CheZ ■ CheY-P] obtained under 
it (see Fq. ISlSI) should be smaller than [CheZ]tot. This gives the following eondition on the aetive fraetion 
a of Che A: 


fcf,jCheZ]tot 

fco4[CheA]tot 


(S21) 


In other words, to be in the fast phosphotransfer regime, the veloeity of the autophosphorylation of CheA 
needs to be slower than the maximal veloeity of the dephosphorylation of CheY-P by CheZ. Fqs . 1819118201 
show that, in the fast phosphotransfer limit, both [CheY-P] and '0, whieh ean be eonsidered as the outputs 
of the pathway, depend only on the kinetie rates, on the aetive fraetion a of CheA, and on the ratio of 
the total eoneentrations of CheZ and CheA. Henee, if a is eonstant, both [CheY-P] and -0 are invariant 
to eoneerted variation of the abundanees of all the proteins in the pathway, keeping the abundanee ratios 
constant (as in Ref. (B). 

These results can be used to obtain the gain for the simplified pathway in the fast phosphotransfer 
limit. The gain in [CheY-P], defined in Fq.[T3 can be obtained from Fq. l819l bv differentiating [CheY-P] 
with respect to a, yielding 


l^CheY-P — 


[CheZ] tot 
[CheA] tot 

[CheZ],ot • 

[CheA],ot 


(822) 


Using Fqs. l811l and l818[ we can express this gain as 


GcheY-P — 


[CheZ]tot 

[CheZ] 


(823) 


This expression demonstrates that the gain in [CheY-P] arises from saturation of the phosphatase CheZ. 
8 imilarly, the gain in -0, defined in Eq.[T3l can be obtained from Eq. l820l bv differentiating ip with respect 


to a, yielding 


G„ = 


[CheZ] tot 
[CheA] tot 


[CheZ] tot I 
[CheA]tot “ 




[CheZ],ot 0fcf fcf 
[CheA]tot a ■ 


(824) 


2.3 Comparison with results from the full pathway 

The validity of the fast phosphotransfer limit in Eq . l817l and of the assumptions in our simplified pathway 
model (see above), can be tested against the results from the full pathway. We find that the results agree 
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well at reference expression levels and higher of the chemotaxis signaling proteins, for the parameter 
values used here (kinetic rates in Table and abundance ratios equal to those in Table IS^ . 

In our full pathway model, a is coupled to the rest of the pathway through [CheB-P] (see Eq.[TTl). Solv¬ 
ing the full pathway yields a as a function of the expression level of all chemotaxis signaling proteins 
(Fig. ISTI) . At high expression levels, a reaches a plateau. The asymptotic value of a at high expression 
levels can be calculated within the fast-phosphotransfer limit of our simplified pathway. For this, we 
express [CheB-P] as a function of a, using Fq. |5] and Fq. [TO] at steady state, and use the solutions of 
the simplified pathway in the fast phosphotransfer limit for [CheA-Pj. We then use Fq.fTTl together with 
this expression for [CheB-P] in order to solve for a. In the limit of high abundances (keeping abundance 
ratios constant), this amounts to solving a second-degree equation, which yields the asymptotic value of 
ot. With the parameter values used here (kinetic rates in Table ISTl and abundance ratios equal to those in 
Table IS^. we obtain a = 0.27, close to the value a = 0.26 obtained for 2.5-fold overexpression from 
the full pathway model (Fig. ISTI) . 



Figure S1: Adapted active fraction a of CheA versus fold expression of all chemotaxis signaling proteins, 
obtained from the pathway model in the adapted state fFqs.fTinTI). Thick curve: result from the full path¬ 
way model. Thin line: asymptotic high-abundance result from the simplified pathway model in the fast 
phosphotransfer regime. One-fold expression corresponds to the abundances measured in Ref. (!l6h for 
strain RP437 in rich medium. In the same rich medium, the fold expression for strain OW1 is about 0.22, 
while in minimal medium, the fold expression is about 1.1 for strain RP437 and 2.0 for strain OWl (B 
(values averaged over all the chemotaxis signaling proteins). See also Fig. |4|in the main text. 


Hence, the results from our simplified pathway model in the fast phosphotransfer limit are relevant at 
high abundances, and account for the observed plateaus of [CheY-P] and xjj in Fig.|4j4, and of the gain in 
Fig.jl^. Using the high-abundance asymptotic value a = 0.27, Fq. IS19l vields [CheY-P] = 2.8 /iM, and 
Eq. lS20l yields ip = 0.44. These values are close to those obtained for high overexpression (specifically, 
[CheY-P] = 2.6 /iM, and ip = 0.43 for 2.5-fold overexpression, see Fig. |4]4). Similarly, Fq. IS22I yields 
GcheY-p = 1-5 here, and Eq. lS24l yields = 0.84, extremely close to the values obtained for high over¬ 
expression (Fig.|4^). These asymptotic high-abundance values from our simplified pathway model in the 
fast phosphotransfer limit are plotted as thin lines in Fig.|4j4-B. In addition, the CheZ concentration from 
the full pathway solution is used to plot the ratio in Eq. IS23[ which is the dotted line in Fig. |4^. It gives 
a good approximation to the actual gain in [CheY-P] for sufficiently high abundances, of order one-fold 
and above. 
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2 SIMPLIFIED PATHWAY MODEL 


2.4 Low-abundance limit 

While the fast phosphotransfer regime is the relevant one for standard eellular abundanees and higher, the 
pathway’s behavior is very different in the limit of low abundanees. Indeed, sinee autophosphorylation of 
CheA is autonomous, while phosphotransfer to CheY involves a two-moleeule eneounter between CheA- 
P and CheY, the fraetion of phosphorylated CheA beeomes high in the limit of low overall abundanees. 
More preeisely, if [CheYjtot ^ ak^^Jk^, then Eg. IS 131 ensures that 


[CheA] < [CheA-P] « [CheAjtot. 


(S25) 


Provided that [CheYjtot k^at/^a that [CheYjtot 'C = kf /k^, small proportions of CheZ and 
of FliM are bound to CheY-P. We obtain, using Eqs. lS15l and lS16l 


k^ 

[CheZ • CheY-Pj ^ -^[CheZjtot[CheY-Pj, 

^cat 

(S26) 

kM 

[FliM • CheY-Pj « [FliM]«[CheY-Pj. 

(S27) 


'd 


In this regime, if in addition [CheZjtot -C k^^Jk^, whie implies that a small fraetion of CheY-P is bound 
to CheZ (see Eq. lS26l) . Eqs. lS14l and lS15l vield 


[CheY-Pj 


[CheY] 


tot 


hz rrhp7i.. 


hM , 


(S28) 


Given that we vary the overall abundanees of CheA, CheY, CheZ, CheB, and CheR, while keeping their 
proportions and the FliM abundanee fixed (see main text), Eq. IS28I shows that in the low-abundanee 
limit, [CheY-P] grows in proportion to the overall abundanees of the Che proteins (indeed it shows that 
[CheY-Pj oc [CheYjtot). 

Eqs. lS27l and lS28l vield an expression for -0 in this limit: 




[CheY-Pj 


[CheY] 


tot 


+ [FliMjt 


T^M fcf [CheZjtot 
kX [CheAjtot 


(S29) 


These asymptotie low-abundanee expressions from our simplified pathway model are plotted as thin 
dotted lines in Fig. |4jA.. 

Note that here, we have just studied the low-abundanee limit of our simplified pathway model. In 
praetiee, additional effeets might eome into play, for instanee the formation of the ehemoreeeptor array 
and of the flagellar motor would likely be impaired at too low abundanees. 
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3 Effect of a variation of the ievei of each protein of the pathway 

The effect of varying the level of each protein of the chemotaxis pathway is discussed in the main text, 
and Fig.[5]shows how the gain is affected by these individual variations of protein levels. Here, we present 
results regarding the direct outputs of the pathway, namely the adapted fraction ip of FliM proteins bound 
to CheY-P (Fig.lS^ and the adapted free CheY-P concentration, [CheY-P] (FigJS3]). 

These results, especially Fig. |S3l enable a direct comparison with Ref. (IlSb . where [CheY-P] was 
considered, but not the gain of the pathway. 








Figure S2: Effect of fold-change of expression of each chemotaxis signaling protein separately, obtained 
from the pathway model in the adapted state (Eq s.[IlfTT]) . One-fold expression corresponds to the abun¬ 
dances in Table [S2l i.e. to those measured in Ref. (Ilbh for strain RP437 in rich medium, as in Eig.lH A-F. 
Blue curves: adapted fraction ip of EliM proteins bound to CheY-P versus fold expression of each pro¬ 
tein, keeping all others at their one-fold level. Blue dots: one-fold expression case. In the shaded zones. 
Ip is either smaller than 0.11 or larger than 0.57, in which case the flagellar motor should rotate only 
counterclockwise or only clockwise, respectively, in the adapted state (12 ih . 
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3 EFFECT OF A VARIATION OF THE LEVEL OF EACH PROTEIN OF THE PATHWAY 
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Figure S3: Adapted free CheY-P eoneentration, [CheY-P], versus fold expression of eaeh protein, keep¬ 
ing all others at their one-fold level. Same eonventions as in Fig.[S^ Here too, in the shaded zones, is 
either smaller than 0.11 or larger than 0.57 (see Fig. [S^. 
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4 Comparison of the autocatalytic rates of different histidine kinases 


Table [S3] lists experimental values from the literature for the autoeatalytie rates of different histidine 
kinases involved in baeterial two-eomponent signaling systems. CheA is mueh faster than all of these, 
with an autoeatalytie rate = 2.6 x 10“^ /s in £. coli (l77b and = 0.24 /s in 5. typhimurium n2% 
when isolated, and = 23 /s in 5. typhimurium (|2^ when in eomplex with ehemoreeeptors and CheW. 


Table S3: Autoeatalytie rate fccat for various histidine kinases (from different baeteria). 


Organism 

Funetion 

Histidine kinase 

^cat (/s) 

Ref. 

Enterococcus faecium 

Antibiotie resistanee 

Vans 

2.83 X 10-^ 

0^ 

Bacillus subtilis 

Sporulation 

KinA 

1.90 X 10-3 

(79) 

Bacillus subtilis 

Cold shook response 

DesK 

2.80 X 10-3 

(80) 

Thermotoga maritima 


HpkA 

4.23 X 10-^ 

(81) 

Streptococcus pneumoniae 

Virulenoe, eto. 

WalKSpn (C)-His 

1.40 X 10-3 

(82) 

Streptococcus pneumoniae 

Virulenoe, eto. 

WalKSpn (N)-Sumo 

3.60 X 10-3 

(82) 

Escherichia coli 

Response to nitrite 

NarX 

5.00 X 10-3 

(83) 

Escherichia coli 

Response to nitrate 

NarQ 

2.20 X 10-^ 

(83) 

Synechocystis 

Light signaling system 

Cphl holo 

2.00 X 10-^ 

(M) 

Synechocystis 

Light signaling system 

Cphl apo 

3.00 X 10-^ 

CM) 

Myxococcus xanthus 

Aggregation; sporulation 

RodK 

1.67 X 10-^ 

(8^ 
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5 EXPRESSION LEVELS OF VARIOUS PARALOGS OF THE CHEMOTAXIS GENE CLUSTERS 


5 Expression levels of various paralogs of the chemotaxis gene clusters 

Here, we eompare the expression levels of che genes aetually involved in ehemotaxis to those of 
non-ehemotaetie che genes in baeteria that have multiple che gene elusters in their genome. For this 
eomparison, we use data from published mieroarray studies. 


5.1 Pseudomonas aeruginosa 

The genome of P. aeruginosa ineludes four main elusters of che genes Among these, two are 
involved in ehemotaxis, with one, PA1456-1464 (+ PA3348-3349 eontaining cheR), being essential for 
ehemotaxis, and the seeond one (PAOl 80-0173) being non-essential. Ref. (l86h showed that strains deleted 
for the latter eluster exhibit positive ehemotaetie response (to peptone and phosphate). In Ref. (l87h . this 
eluster was found to be required for an optimal ehemotaetie response (in addition to the main eluster), 
but the authors state that they eannot exelude that its major output is some other funetion. Amon g th e 
two remaining elusters, one (PA0408-0415) is involved in twitehing motility based on type TV pili (881. 
and the last one (PA3708-3702) regulates biofilm formation through modulation of e-diGMP levels (l89t) . 

In Ref. d^, a mieroarray analysis of P. aeruginosa was eondueted to study its response to hydro¬ 
gen peroxide. The full mieroarray data are available both in the absenee (eontrol) and in the presenee 
of hydrogen peroxide. Table IS4l eorresponds to the mieroarray results (i.e., the abundanee of mRNA) in 
the eontrol data, for cheY, cheA, cheW, cheR, and cheB from all ehemotaxis elusters. It gives the ratio of 
the expression levels of the genes from the main ehemotaxis eluster to those of the eorresponding genes 
from eaeh other eluster. 


Table S4: Mieroarray expression data for P. aeruginosa, from eontrols in Ref. (l90h . The numbers given 
are ratios of the expression levels of the genes from eaeh of the three elusters of che genes that are 
non-essential to ehemotaxis, to those of the eorresponding genes from the essential ehemotaxis elus- 
ter, PA1456-1464 (- 1 - PA3348-3349 eontaining cheR). The indieation “(2)” means that two genes of this 
type are present in the eluster eonsidered (two cheW genes exist in the main ehemotaxis eluster, as well 
as in the twitehing-assoeiated one and in the biofilm-assoeiated one, and two cheY genes exist in the 
twitehing-assoeiated eluster). In these eases, the results obtained for eaeh of the gene eopies within the 
eluster eonsidered were summed. 


*Note that the response regulator eneoded by the biofilm-assoeiated eluster, WspR, is elassified as 
non-CheY (56). 



Chemotaxis II 
PAOl 80-0173 

Twitehing 

PA0408-0415 

Biofilm formation 

PA3708-3702 

cheY 

0.22 

5.34 (2) 

0.22* 

cheA 

0.09 

0.93 

0.23 

cheW (2) 

0.02 

0.58 (2) 

0.24 (2) 

cheR 

0.15 

0.52 

0.21 

cheB 

0.02 

0.30 

0.31 


Table [S^ shows that genes from the main ehemotaxis eluster are signifieantly more highly expressed 
than those of the seeond eluster involved in ehemotaxis. They are also more expressed than genes from 
the biofilm-assoeiated one. However, the cheA and cheW genes from the twitehing-assoeiated eluster 
have similar levels of expression as those of the main ehemotaxis eluster, and the two eopies of cheY 
in the twitehing-assoeiated eluster are together five times more expressed than the cheY involved in 


Biophysical Journal 












29 


chemotaxis. This twitching motility-associated che gene cluster is known to modulate the activity of an 
adenylate cyclase involved in virulence, and to have additional roles in transcriptional regulation, but this 
pathway is complex and not fully elucidated yet (15 811 . Chemosensing for directed twitching motility (on 
surfaces) has to be fast as in swimming, and the twitching-motility system is known to mediate chemo¬ 
taxis towards phospholipids dolh . Hence, the abundance of these motility-associated proteins might partly 
arise from rapidity constraints, as in the case of swimming chemotaxis. In addition. Ref. (1881) suggests 
a possible role of (one of) the CheYs of this system as a phosphate sink (as in the chemotaxis systems 
of Rhodobacter sphaeroides, which include no CheZ): this might explain the very high expression of the 
two cheY genes in the P. aeruginosa twitching motility-associated cluster. 

Notably, the main chemotaxis cluster is the only one that includes CheZ, whose role is to 
dephosphorylate CheY rapidly. 


5.2 Vibrio cholerae 


The genome of V. cholerae includes three main clusters of che genes (156h . Among these, only one, 
VC2059-2065 (-i- VC2201-2202 containing cheR) is involved in chemotaxis. The functions of the 
other two clusters (VC1394-1406 and VCA1088-1096) are not known yet. In fact, neither deletion nor 
overexpression of the genes in these clusters has been found to produce any detectable phenotype (1921) . 

In Ref. (1931), the transcriptome of V. cholerae was studied both for bacteria grown in vitro and dur¬ 
ing intraintestinal growth (in the latter case, the bacteria were harvested from rabbit ileal loops). The 
full microarray data are available in both of these conditions. Table IS5f al and (b) gives the ratio of the 
expression levels of the genes from the actual chemotaxis cluster to those of the corresponding genes 
from each other cluster, calculated from the microarray data of Ref. 


Table S5: Similar data as in Table lS4l for V. cholerae. (a) and (b): Microarray expression data from 
controls in Ref. (l93h - (a): harvested from rabbit ileal loops; (b): grown in vitro. 


(a) 

VC 1394-1406 

VCA1088-1096 

cheY 

0.45 (2) 

0.29 

cheA 

0.70 

0.25 

cheW 

0.30 

0.53 (2) 

cheR 

0.63 

0.62 

cheB 

0.14 

0.20 

(b) 

VC 1394-1406 

VCA1088-1096 

cheY 

0.22 (2) 

0.11 

cheA 

0.89 

0.10 

cheW 

0.16 

0.42 (2) 

cheR 

0.33 

0.44 

cheB 

0.14 

0.24 


In spite of some variability between conditions, the data in Table |S5] consistently show that the 
actual chemotaxis cluster is more expressed than the other two che gene clusters. Here too, the actual 
chemotaxis cluster is the only one that includes CheZ, whose role is to dephosphorylate CheY rapidly. 
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5 EXPRESSION LEVELS OF VARIOUS PARALOGS OF THE CHEMOTAXIS GENE CLUSTERS 


5.3 Caulobacter crescentus 


C. crescentus has two different che gene elusters, and only one of them is known to be involved in 
ehemotaxis. Here we eompare the expression levels of the seeond che eluster (CCNA00625-00634) to 
those of the genes involved in ehemotaxis (CCNA00439-00450). Ref. (l94h investigated gene expression 
in different phases of the eell eyele of C. crescentus, and reported a full mieroarray study of C. crescentus 
in these different phases. 


Table S6: Similar data as in Table [S41 for C. crescentus. Mieroarray expression data from Ref. (I94t) . 


Phases of the eell eyele: swarmer (SW), stalk (ST), early predivisional (EPD), predivisional (PD), and 
late predivisional (LPD). 



CCNA00625-00634 

SW 

ST 

EPD 

PD 

EPD 

cheY (3) 

0.24 (3) 

1.17 (3) 

0.10(3) 

0.06 (3) 

0.14(3) 

cheA 

Below deteetion level 

Below deteetion level 

0.03 

0.03 

0.04 

cheW 

0.17 

0.11 

0.06 

0.06 

0.06 

cheR 

0.27 

1.00 

0.13 

0.06 

0.14 

cheB 

0.02 

Below deteetion level 

0.01 

0.02 

0.02 


5.4 Sinorhizobium meliloti 

S. meliloti has two different che gene elusters, and only one of them is known to be involved in ehemo¬ 
taxis. There are also two isolated cheW genes (one being elose to an MCP-eoding gene). Here we 
eompare the expression levels of the seeond ehemotaxis eluster (SMal550-1561) to those of the genes 
of the main eluster involved in ehemotaxis (SMo03004-03012). We do not eonsider the isolated cheW 
gene beeause it is not known whether it is involved in ehemotaxis. Ref. (l95h investigated the infiuenee of 
antimierobial nodule-speeifie eysteine-rieh peptides, and reported full mieroarray study of S. meliloti. 


Table S7: Similar data as in Table [SH for S. meliloti. Mieroarray expression data from eontrols in 
Ref. (l95h (10 or 30 min ineubation with shaking in modified LSM medium, after the addition of ster¬ 
ile water - instead of antimierobial peptide). 

*Note that the response regulator eneoded by the seeond eluster, CheO, is elassified as non-CheY dSbh . 



SMal550-1561 (10 min) 

SMal550-1561 (30 min) 

cheY (2) 

0.52* 

0.78* 

cheA 

0.33 

0.31 

cheW 

0.30 

0.27 

cheR 

0.54 

0.57 

cheB 

0.30 

0.27 


5.5 Rhodobacter sphaeroides 

R. sphaeroides has three different che gene elusters, and two of them are known to be involved in ehemo¬ 
taxis. There is in addition one isolated cheY, whieh is also essential to ehemotaxis. Here we eompare the 
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expression levels of the third che eluster to those of the genes involved in ehemotaxis. Ref. (l96h investi¬ 
gated the role of a protein in the iron metabolism of R. sphaeroides, and reported full mieroarray studies 
of R. sphaeroides in different eonditions. Here we report the results obtained for the wild-type strain 
in the presenee and in the absenee of iron. Similarly, Ref. (l9^ used mieroarrays to investigate the role 
of a light, oxygen, voltage domain protein in blue light-dependent and singlet oxygen-dependent gene 
regulation in R. sphaeroides and reported full mieroarray data. 

Note that these data are to be interpreted with eaution, first because of the complication of R. 
sphaeroides having two essential ehemotaxis systems, and second because the expression level of many 
of the genes considered here were found to be below the threshold of significance defined in the original 
publications. Note also that the non-chemotactic che cluster encodes a putative CheX, and that CheX 
plays the part of a CheY phosphatase in Borrelia burgdorferi 


Table S8: Similar data as in Table [S4l for R. sphaeroides. (a): Mieroarray expression data from controls 
in Ref. (l96l) . (b): Mieroarray expression data from controls in Ref. (l97b . 


(a) 

RSP2433-2443 (WT-i-Fe) 

RSP2433-2443 (WT-Fe) 

cheY (3) 

1.40 (3) 

0.53 (3) 

cheA (3) 

0.04 

0.01 

cheW (3) 

Below detection level 

0.04 

cheR (2) 

0.09 

0.25 


(b) 

RSP2433-2443 (WT) 

cheY (3) 

0.92 (3) 

cheA (3) 

0.34 

cheW (3) 

0.13 

cheR (2) 

0.30 
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6 EXPRESSION LEVELS OF GENES CODING FOR TWO-COMPONENT SYSTEMS IN E. COL! 


6 Expression levels of genes coding for two-component systems in E. coli 

The genome of E. coli eomprises 29 sensor histidine kinases and 32 response regulators involved in 
two-eomponent signaling systems (1121 l99h . whieh are involved in the eelTs sensing and response to its 
environment. It is interesting to eompare the expression levels of the ehemotaxis genes to those of other 
two-eomponent signaling systems. In Table [S9l we present the number of protein eopies per eell for the 
histidine kinases and response regulators of various two-eomponent systems in E. coli. The variability 
aeross estimates from different studies may be explained by the differenees in media, growth p hases, and 
strains of E. coli, whieh are known to yield signifieant variations of abundanees (llbLIlOOLIlOlh . and also 
perhaps by the different teehniques used: 


In Ref. (l59ll . the proteome of E. coli was quantified at the single-eell level using single-moleeule 
fluoreseenee, thanks to a yellow-fluoreseent-protein fusion library. E. coli BW25993 eells were 
grown in LB media and then inoeulated into M9 media supplemented with glueose, amino aeids, 
and vitamins. The optieal density was 0.1-0.5. 


Ref. (llOll) used mass speetrometry (more preeisely, emPAI) to quantify the abundanee of proteins 
in E. coli, foeusing mostly on eytoplasmie proteins. E. coli MC4100 eells were grown in rieh or 
minimum medium to exponential phase (optieal density ~0.4), but the datasets obtained with the 
two different media were eombined in the final analysis. 


Ref. (Il02h used the emPAI teehnique as Ref. (llOlh . but this work also quantified the abundanees 
of membrane proteins, first extraeting them with the aid of a removable phase transfer surfae- 
tant (PTS). In this referenee, E. coli BW251I3 cells were grown in LB medium and harvested at 
stationary phase. 

Ref. ( 113 ) used quantitative immunoblotting, focusing only on ehemotaxis proteins. Several con¬ 
ditions were studied. Here we report the values obtained for strain RP437 in Tryptone broth (rich 
medium) with cells grown to an optical density of 0.5. 

Ref. ( lOOl) focused on the levels of EnvZ and OmpR proteins using quantitative Western blot 
analysis. The E. coli strain used was MC4100. The values reported here were obtained during 
exponential growth in L-broth medium. 
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Table S9: Cellular abundances of the proteins from various two-component systems in E. coli. Data from 
several proteomic studies (see also ‘http://ecoliwiki.net/colipedia’). All values are in copy numbers per 


cell. 


Two-component system 
(Histidine kinase / response regulator) 

Histidine kinase 
(^ (T^ 

Other 

Response regulator 
(^ (T^ (102) 

Other 

CheA/ CheY: Chemotaxis 




6,700 (1^ 

12 



8,200 (1© 

EnvZ / OmpR: Osmolarity sensing 



1.3 

100(100) 

81 

613 

238 

3,500 (100) 

NarX / NarL: Response to nitrite 



1.3 



229 

522 


PhoR / PhoB: Phosphate regulation 

11 








EvgS / EvgA: Drug resistance 

5 

82 




198 

26 


CusS / CusR: Copper response 

2 


1.4 






YedV / YedW 

3 


0.8 


1.5 




KdpD / KdpE; Potassium transport 

9 


6/0.7 


6 




BaeS / BaeR 





12 

167 



HydH / HydG 

1 








PhoQ / PhoP: Response to magnesium 

7 


1.2 



786 

113 


BasS / BasR: Polymyxin resistance 

0.7 


1.3 


65 




CpxA / CpxR: Response to cell envelope stress 



1.8 


33 

664 

316 


TorS / TorR 

3 








DcuS / DcuR 



1.1 


0.6 




RcsC / RcsB: Capsular synthesis 

9 


1.1 


369 

1,490 

597 


CitA / CitB 





1 




ArcB / ArcA: Respiratory control 

56 

100 

32/1.5 



2,660 

550 


BarA / UvrY; Hydrogen peroxide sensitivity 

2 


1.3 


29 


18 



7 Cellular localization of the protein FMM 

The protein FliM is a constituent of the cytoplasmic ring of the rotor that mediates rotation-direction 
switching in response to binding of CheY-P. Several studies reveal that only a small fraction of FliM is 
part of complete motors (Table [STO]): 


In Ref. (l27h . the relative abundances of FliM in the membrane and cytoplasmic fractions were 
estimated by lysing the cells and separating the membranes from the cytoplasm by centrifugation. 
It was found that ~1,100 copies of FliM out of 1,400, i.e., ~78% of FliM copies, were in the 
cytoplasmic fraction, and thus not in complete motors. 


In Ref. (1281) . the abundance and localization of FliM were studied by fluorescence microscopy. 
There were 24 zh 6 spots, corresponding to assemblies of multiple FliM, per cell. The distribution 
of the number of molecules per spot showed two peaks, one at 32 molecules (in agreement with 
previously measured numbers of FliM per flagellar motor) accounting for about 40% of the spots, 
and one at 18 molecules, which may correspond to partly assembled cytoplasmic rings, accounting 
for about 45% of the spots (|28b . The remaining 15% of the spots fell outside these peaks, corre¬ 
sponding to structures not independently resolved. Hence, 40 — 55% of the FliM in spots were part 
of full cytoplasmic rings. In addition, background fluorescence showed the presence of 630 ± 290 
FliM molecules not associated with spots, and the total number of FliM copies per cell was esti¬ 
mated to be 1,450 ± 360. These data yield a fraction from 0.4 x (1450 — 630)/1450 = 0.23 to 
0.55 X (1450 —630)/1450 = 0.31 of FliM copies that may actually be inside full cytoplasmic rings. 
In other words, 69 — 77% of FliM copies were outside complete motors. Besides, a fraction from 
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0.45 X (1450 - 630)/1450 = 0.25 to 0.60 x (1450 - 630)/1450 = 0.34 of FliM copies appeared 
to belong to partly assembled rings. 

• In Ref. Q (which studied S. typhimurium while the other referenees studied E. coli), whole-cell 
lysates were separated into three fraetions by sedimentation: 16% ± 3% of FliM was found to 
be in the soluble fraetion and was thus not part of motors. Gel filtration enabled separation of 
the larger basal body structures into full flagellar motors and incomplete precursors. Among these 
large structures, about 31% of the FliM was found to be part of precursors. At the end of their 
analysis, the authors also stated that “about half of FliM remained unaeeounted for [in the gel fil¬ 
tration results], suggesting that FliM may form presently uneharaeterized, partieulate aggregates 
in addition to being part of flagellar basal bodies”. They suggested that those eorrespond to “dis- 
soeiable FliM assembly intermediates that either get stuek on or elute very late from the eolumn”, 
thus not appearing in the gel filtration results. Combining this, we may estimate that a fraetion 
0.5 -1-0.16-1- 0.31 X (1 — 0.5 — 0.16) = 0.76 of FliM eopies is not part of eomplete motors. 


Table SIO: Number of FliM eopies per eell and partition. 


Organism 

Total per eell 

Outside complete motors 

In partly-assembled structures 

Ref. 

E. coli 

1,400 ± 200 

78% 

Not evaluated 

(ZD 

E. coli 

1,450 ± 360 

69-77% 

25-34% 

Q8) 

S. typhimurium 

1,640 ± 300 

76% 

31-81% 

(31) 


We ean also estimate the number of FliM that aetually belong to funetional flagella. In wild-type E. 



31) reports about 3 ± 1.5 flagella per 
Studies of the stoiehiometry of the 


coli. Ref. (1271) reports on average 2.6 flagella per eell, and Ref, 
cell. In S. typhimurium, there are about 6-10 flagella per eell 

motor report 37 ± 13 FliM eopies per motor (ll3L l3lb (in S. typhimurium), and about 32 (l28l) (in E. coli). 
This would give a number of order 100 FliM that actually belong to funetional flagella in E. coli (about 
7%), and about 300 in S. typhimurium (about 18%). This is even less than what would be expected from 
the results above (Table ISTOl) . especially for E. coli. This difference may indicate that some eomplete or 
almost eomplete eytoplasmic rings or motors do not belong to flagella. These might be in the last stages 
of assembly. 

In spite of the high fraetion of FliM that are not in funetional flagellar motors, the number of flagella 
per eell inereases when FliM is (not too hig hly) overexpressed, whieh indieates that FliM eonstitutes a 
limiting resouree in flagellar assembly (l27h . Besides, unde rexp ression of FliM reduees the number of 
flagella per eells and their effieieney. Consistently, in Ref. (12 ill) , the number of FliM per eell neeessary 
for optimum motility was about 4900 (about 3.5 times higher than in the wild-type cells), and only about 
20 — 30% of these FliM were found in functional or incomplete flagellar motors (note that these FliM 
were fluoreseently labeled). 

Reeent work indieates that the FliM proteins present in r noto r s exc hange with the eytoplasmic pool, 
and that the number of FliM per motor is vari able (28, 13, lOdl lOSb . The number of FliM copi es pe r 
motor depends on the eoneentration of CheY-P (llOdb . through the direetion of rotation of the motor dlOSh . 
This allows for adaptation of the motor to the eoneentration of Ch eY-R by shifting the range of CheY-P 
eoneentration over whieh the eloekwise bias of the motor ehanges (1 104b . whi eh is very narrow (l3.5b . The 
fraetion of FliM that exehanges depends on the direetion of rotation too ( lOSb . Henee, eytoplasmie FliM 
seems to have a funetion in motor adaptation. 
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